SARKOZY'S THEOREM FOR P-INTERSECTIVE POLYNOMIALS 



ALEX RICE 



Abstract. We define a necessary and sufiicient condition on a polynomial ft, £ Z[a;] to guarantee that every 
set of natural numbers of positive upper density contains a nonzero difference of the form h{p) for some 
prime p. Moreover, we establish a quantitative estimate on the size of the largest subset of {1, 2, . . . , A'^} 
which lacks the desired arithmetic structure, showing that if deg(/i) = fc, then the density of such a set is at 
most a constant times (logA'^)"'^ for any c < l/(2fc — 2). We also discuss how an improved version of this 
result for k = 2 and a relative version in the primes can be obtained with some additional known methods. 



1. Introduction 
1.1. Background. A set A C N is said to have positive upper density if 

hmsup ^ > 0, 

where [1,A^] denotes {1, 2, . . . , iV}. In the late 1970s, Sarkozy and Furstenberg independently confirmed 
a conjecture of Lovasz that any set of natural numbers of positive upper density necessarily contains two 
elements which differ by a perfect square. Furstenberg [2] used ergodic theory and obtained a purely quali- 
tative result, proving the conjecture exactly as stated above. Sarkozy, however, employed a Fourier analytic 
density increment strategy, inspired by Roth's proof of the analogous conjecture for three-term arithmetic 
progressions |17) . to prove the following quantitative strengthening. 

Theorem A (Sarkozy, [l9j). If AC[1,N] and A - A for all n e N, then 

\A\ /(loglogiV)2\i/3 



iv V 



logiV 



In this and the following theorems. A — A denotes the difference set {a — a' : a, a' € A}, the symbol ^ 
denotes "less than a constant times" , and we implicitly assume that N is large enough to make the right hand 
side of the inequalities defined and positive. An extensive literature has been developed on improvements 
and extensions of Theorem A, for which the reader may refer to [15] , [I] , [21] , [12] , [13] , [8] , and [6] . In the 
same series of papers, Sarkozy answered a similar question of Erdos concerning shifted primes. 

Theorem B (Sarkozy, [20,). If AQ [l^N] and p — 1 ^ A — A for all primes p, then 

\A\ (log log log A^)^ log log log log jV 
^ ' N (loglogA)2 

The bounds in Theorem B have been improved, first by Lucier |llj and later by Ruzsa and Sanders [18) . 
who replaced (P with \A\/N < 

A natural generalization of Theorem A is the replacement of the squares with the image of a more general 
polynomial. However, to hope for such a result for a given polynomial h e Z[x], it is clearly necessary that 
h has a root modulo q for every q € N, as otherwise there would be a set qN of positive density with no 
differences in the image oi h. It follows from a theorem of Kamae and Mendes France [7] that this condition 
is also sufficient, in a qualitative sense, and in this case we say that h is an intersective polynomial. 
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Examples of intersective polynomials include any polynomial with an integer root and any polynomial 
with two rational roots with coprime denominators. However, there are also intersective polynomials with 
no rational roots, for example (x^ — 19){x^ + a; + 1). The best current bounds for this most general setting 
are essentially due to Lucier, who successfully adapted the density increment procedure by utilizing p-adic 
roots and allowing the polynomial to change at each step of the iteration. 

Theorem C (Lucier, |12j). Suppose h € Z[x] is an intersective polynomial of degree k > 2 with positive 
leading term. If A C [1,N] and h{n) ^ A — A for all n €N with h{n) > 0, then 

\A\ r{\og\og NY \^/(k^i) h ifk = 2 

Iv^y logiV / ' ^^[2 ifk>2' 

where the implied constant depends only on h. 

In |16j , the author made an extremely mild improvement to Theorem C, showing that one can in fact 
take /i = 1. By the symmetry of difference sets. Theorem C and all the following theorems clearly imply 
the analogous results for the negative values of a polynomial with negative leading term. Recently, Hamel, 
Lyall, and the author utilized Lucier's techniques in extending the best known bound on the size of a set 
with no square differences, due originally to Pintz, Steiger, and Szemeredi [T5] and extended to fc*^ powers by 
Balog, Pelikan, Pintz, and Szemeredi [T], to all intersective polynomials of degree 2, which is to say quadratic 
polynomials which have two rational roots with coprime denominators. 

Theorem D (Hamel, Lyall, Rice, 'F ). Suppose h G Zlx] is an intersective quadratic polynomial with positive 
leading term. If A Q [1, N] and hin) ^ A — A for all n Cz N with hin) > 0, then 

1^1 <C (log iV)^'^'°^'°8'°^'°^^ 
for any c < l/log3, where the implied constant depends only on h and c. 

Some work has also been done to combine extensions of Theorem A with Theorem B. Li and Pan |10j 
established the following quantitative result. 

Theorem E (Li, Pan, [lOj). Suppose h e has positive leading term and h{l) =0. If A Q [1,.^] and 
hip) ^ A — A for all primes p with h(jp) > 0, then 

\A\ 

^ < 1/ log log log iV. 

Additionally, Le and Li-Pan applied transference principles inspired by work of Green and Tao ([3], [3], 
[5]) to prove analogs of Theorems C and E, respectively, for dense subsets of the primes, which we denote 
by P. We state the qualitative results below. 

Theorem F (Le, [5]). If h ^ Z[a;] is an intersective polynomial with positive leading term, AQV, and 
then there exist a,a' G A and n S N with a — a' = h{n) > 0. 

If A QV meets condition ([2|), we say that it has positive relative upper density in the primes. 

Theorem G (Li, Pan, [10 ). If h ^ Z[x] has positive leading term, h{\) = 0, and AQV has positive relative 
upper density, then there exist a, a' G A and p Cz V with a — a' = h(p) > 0. 

1.2. Main result of this paper. Just as there are intersective polynomials without integer roots, it seems 
natural to think that a result like Theorem E should hold for a larger class of polynomials. A moments 
consideration indicates that the correct analog to the intersective condition on a polynomial h when looking 
for differences of the form h{p) is to insist that h not only has a root modulo q for every g G N, but has a 
root at a congruence class that admits infinitely many primes, leading to the following definition. 
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Definition 1. A polynomial h £ 'Zlx] is called V -intersective if for every q G N, there exists r G Z such 
that (r, q) — 1 and g | Equivalently, for every p € V, there exists Zp G Zp, where Zp denotes the p-adic 

integers, such that h{zp) — and Zp ^ mod p. 

Remark. After the initial version of this paper was posted on arXiv server, the author learned that in [9j, a 
survery on problems and results on intersective sets, Thai Hoang Le independently posed the same question 
and arrived at the same definition, which he coined intersective polynomials of the second kind. Even later, 
the author learned that these polynomials were considered by Wierdl [23j in his thesis, where he called them 
intersective along the primes. 

Examples of P-intcrsective polynomials include any polynomial with a root at 1 or —1, any polynomial 
with two rational roots a/b and c/d such that {ab, cd) — 1, and presumably lots more. The necessity of this 
condition is almost as clear as that of the original intersective condition. To exhibit this, suppose we have 
h G Z[x] and q £ N such that the only roots of h modulo q share common factors with q. In particular, 
there are finitely many primes p such that q \ h{p). Letting m = max{h{p)/q : p £ V, q | if such 

primes exist and m = otherwise, we see that q{m + 1)N is a set of positive upper density which contains 
no differences of the form h{p). Wierdl [33] observed in his thesis that one can again deduce the sufficiency 
of this condition, in a qualitative sense, from the aforementioned theorem of Kamae and Mendes France [7], 
and here we borrow heavily from [T^], [TH], [13], and [TU] to establish the following quantitative result. 

Theorem 1. Suppose h £ Z[x] is a V -intersective polynomial of degree k > 2 with positive leading term. If 
A C [1, N] and h{p) ^ A - A for all p € V with h{p) > 0, then 



but here we stick to the slightly less precise version for a more pleasing exposition. 

1.3. Additional Results. In addition to Theorem [TJ one can conclude the following analogs of previous 
results from the estimates we obtain along the way. 

Theorem 2. Suppose h G Z[a;] is a V -intersective quadratic polynomial with positive leading term. If 
Ac [1,N] and h{p) ^ A - A for all p e V with h{p) > 0, then 



for any c < 1/2 log 3, where the implied constant depends only on h and c. 

Just as in the traditional setting, the T'-intersective condition is greatly simplified when restricted to degree 
2, as a quadratic polynomial is P-intersective if and only if it has rational roots a/b and c/d with {ab, cd) = 1. 

Theorem 3. If h £ Z[a;] is a V -intersective polynomial with positive leading term and A Q V has positive 
relative upper density, then there exist a, a' G A and p V with a — a' ^ h{p) > 0. 

Equipped with the techniques and results of this paper, the modifications of the arguments in [5] and [5] 
required to obtain Theorems [2] and [S] respectively, are so minor that we do not provide the full details here. 
Alternatively, we discuss the required adaptations informally in Appendix [B] 




In fact, with a few careful modifications one can sharpen ([3]) to 

\A\ ( (log log Nf{\og log log Nf^ \ i/(2fc-2) 




— cloglogloglogA'^ 
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2. Preliminaries 



To begin our effort to prove Theorem [TJ we fix a T'-intersective polynomial h of degree k > 2 with positive 
leading term and an arbitrary e > 0, and we set s = 2^^ + 6 and K = 2^°'=. For our current purposes, we only 
use that s > 9, but the choice also plays a role in our discussion of Theorems [5] and [3] We also fix a natural 
number N which, at the expense of constants in Theorem [1] we are always free to insist is sufficiently large 
with respect to h and e. For convenience, we take this as a perpetual hypothesis and abstain from including 
it further. We use the letters C and c to denote appropriately large or small positive constants, which will 
change from line to line and we allow, along with any implied constants, to depend on h and e. 

Further, we fix a set A C [l,iV] with \A\/N = (5 > and set Q{6) = exp (^Co(5"(*^+'"^' j for a constant Co- 

2.1. Auxiliary Polynomials and Related Definitions. We apply the modified density increment strat- 
egy described in [12j, which allows for the polynomial to change at each stage of the iteration. The following 
definitions describe all of the polynomials that we could potentially encounter, as well as several other objects 
that will appear in the argument. 

Definition 2. For each p G V, we fix Zp S Zp with h{zp) = and Zp ^ mod p. By reducing and applying 
the Chinese Remainder Theorem, the choices of Zp determine, for each natural number d, a unique integer 
rd S (— d, 0], which consequently satisfies d \ h(rd) and {rd,d) = 1. 

We define the function A on N by letting — p™, where m is the multiplicity of Zp as a root of /i in Zp, 
and then extending it to be completely multiplicative. 

For each d G N, we define the auxiliary polynomial hd by 

hd{x) = h{rd + dx)/\{d). 

Lucier observed in Lemma 21 of |12| that each hd has integer coefficients, and it is important to note that 
the leading coefficients grow with d at least as quickly, up to a constant, as the other coefficients. 

Further, we let A,; = {x e N : j'^ + e T'}, and for i € N we define 

Hd - Hd{L) = {x e N : < hd{x) < L/s] 

and 

Md = Md{L) = {Llsbdf'\ 
where hd is the leading coefficient of /i^, noting that 
(4) |[l,Md]AFrf| =0(1), 

where A denotes the symmetric difference. We also define a function Vd on Z by 

Md) 

Vd\x) = — ^log(rrf + da;)lA^(a;), 
where <f> is the Euler totient function, and for a set B Q [1, we define 

nd{B)=nd{B,L)= J2 ^B{x)lBix + hdiy))My)- 

In the definitions above, L should always be replaced with the size of the appropriate ambient interval. 

2.2. Counting Primes in Arithmetic Progressions. For X, a, g G N, we define 

p=a mod q 

The classical estimates on i^{X, a, q) come from the famous Siegel-Walfisz Theorem, which can be found for 
example in Corollary 11.19 of [14j. 
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Lemma 1 (Siegel-Walfisz Theorem). If q < (iogX)^ and {a,q) = 1, then 

^{X, a, q) = X/m + 0{Xe-^-^) 

for some constant c = c{D) > 0. 

Ruzsa and Sanders [18] estabhshed asymptotics for ipiXi l) for certain moduh q beyond the hmitations 
of Lemma [T] by exploiting a dichotomy based on exceptional zeros, or lack thereof, of Dirichlet L- functions. 
In particular, the following result follows from their work. 

Lemma 2. If Q{5) < e'^i^'°s j^^^ ^ sufficiently small constant c\ ~ ci(fc) > 0, then there exist qo < Q{6Y^ 
and p e [1/2, 1) with (1 — p)^^ <C qo such that 

where x is a Dirichlet character modulo qo, provided X > N^/^'^^ ^ | {a,q) = 1, and q < {qoQiS))'^^ ■ 

Lemma [2] is a purpose-built special case of Proposition 4.7 of [TH], which in the language of that paper can 
be deduced by considering the pair {Q{Sy^^ , Q{5)^^), where qo is the modulus of the exceptional Dirichlet 
character if the pair is exceptional and qo — I ii the pair is unexceptional. 

As remarked in the proof of Proposition 5.3 of [18], the asymptotic in Lemma [2] implies that under the 
hypotheses we have 

(6) ^{X, a, g) » - > (1 - p)X/(b{q) » ^ 



2.3. A Uniform Estimate on TZd- Wc obtain Theorem[T]as a consequence of the following, stronger result, 
which says that the number of solutions to a — a' = h{p) > with a, a' € A, p € V, has the correct order 
of magnitude. In addition, we obtain this estimate uniformly in d for a range of auxiliary polynomials hd, 
which serves as the primary input required to apply the techniques of [8 and conclude Theorem (3] 

Theorem 4. There exists a constant C depending only on h, e, and Co such that 

nd{A) > exp(-C(5-('=+'-i))7VM<i 
for all d < max{logiV,Q((5)}, provided 6 > C(log A^)-i/2(fe+e-i) _ 

3. Main Iteration Lemma: Deducing Theorem [4| 
We now make the assumption that 



(7) Q{S) < e''^^ 

for a sufficiently small constant ci > 0, which is implied by the condition S > Co(log Af)^^/^(''+'^^^)/ci, and 
we fix p and go yielded by Lemma [2] Also, we set 7 = fc + e/2 and for d, L e N we define 

*d = = <j){d)i;idMd,rd,d)/d, 

noting by ((6|) that for appropriate d and L we have 

(8) *d > (1 - p)Md/ > Md/qo- 

We deduce Theorem |4| from the following iteration lemma, which states that a set which is deficient in the 
desired arithmetic structure spawns a new, significantly denser subset of a slightly smaller interval with an 
inherited deficiency in the structure associated to an appropriate auxiliary polynomial. 

Lemma 3. Suppose B C [1,£], \B\/L = cr > S, and L > ^/N. If qo \ d, d/qo < maxjlog A^, (5((5)}^, and 

UdiB) < a^L^d/8, 

then there exists q < ct^^ and B' C [1,L'] with L' > cttC^'+i)!., UgdiB') < UdiB), and 

\B'\ > {cr + ca')L'. 
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The following proposition exhibits the aforementioned inheritance of deficiency in arithmetic structure, 
and is essential to the deduction of Theorem |4] from Lemma [3] as well as the proof of Lemma [3] itself. 

Proposition 4. If B C [1,L] and B' C {£ e [1, L'] : x+£X{q) E B} for some xeZ,qen, and L' < L/X{q), 
then for any d G N, 

n^diB') < UdiB). 

Proof Suppose B C [1, L], B' C {£ e [1, L'] : x + eX{q) G B}, L' < L/\{q), and 

T'l 0' h ( \ ^(^gd + qdn) 

for e i?', n e Agd- Recalling that r^^ = mod d, there is an integer m such that rqd = r^ + mci, so 

, _ h{rd + d(m + grt)) _ hjim + gn) 
A(g)A(d) ~ \{q) ' 

and therefore 

< ha{m + gn) = \{q)i - A(g)f = (x + A(g)£) - (x + A(g)f ) < \{q)L'/s < L/s. 
Moreover, we know that + d(m + qn) = rqd + qdn G P, so m + G A^, and the result follows. □ 

3.1. Proof of Theorem [4l Fixing d < maxjlog A^, (5(5)} and partitioning [1,A^] into arithmetic progres- 
sions of step size X{qo) and length between N/2X{qo) and N/X{qo), the pigeonhole principle guarantees the 
existence of an arithmetic progression P = {x + £X{qo) : 1 < ^ < Nq} such that N/2X{qo) < Nq < N/X{qo) 
and \Ar]P\/No > S. 

This allows us to define Ao C [1, Nq] by Aq — {£ E [1, No] : x + iX{qQ) G A}, which consequently satisfies 

\Ao\/No^So>d, No>N/Q{S)^'''', and 7^,„<J(Ao) < 7^d(A), 
where the last fact follows from Proposition |4l 

We then iteratively apply Lemma[3l which yields, for each m, a set Am C [1, A^ni] with |A„i| ~ 5mNm and 

(9) ndMn)<nd{A) 

satisfying 

(10) Nm>{c6f'^No, <5„, >(5,„_i+c57„_i, q^\dm, and d„,/go < (c5)"'""d 
as long as 

(11) Nm > Va^, dm/qo < max{logiV,g(5)}2, 
and 

(12) TZdAAm) <SlNm,^d„JS. 

By (ITU|) . we see that the density 6m. would surpass 1, and hence PT|) or (IT^ must fail, with 

(13) TO = 

However, if Co is sufficently large then (US]) implies (cJ)"*^™ < Q{6), hence A^^ > N/Q{6) > VN and 
dm/go < QiS)d < max{\og N,Q{6)Y, so (HU holds. Further, we see by © and (HOI) that 

6lNm'fd^ > (c6f"'NoMd/qo > exp(-Cr('=+^-i))A^Afrf, 

so if TZd{A) < exp(-C(5"(''+'~i))A^Md for a sufficiently large constant C, then by (O we have that (HI]) also 
holds. This yields a contradiction, and the theorem follows. 
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4. Density Increment Strategy: Deducing Lemma [3] 



4.1. Fourier Analysis on Z. We embed our finite sets in Z, on which we utilize the discrete Fourier 
transform. Specifically, for a function F : Z — > C with finite support, we define : T — > C, where T denotes 
the circle parameterized by the interval [0,1] with and 1 identified, by 



F{a) = ^F(a;)e-2'^"". 



Given L £ N and a set i? C [1,L] with \B\ — aL, we examine the Fourier analytic behavior of B by 
considering the balance function, fs, defined by 

/-B = 1b - <^^[l,L]- 

4.2. The Circle Method. We analyze the behavior of Jb using the Hardy-Littlewood circle method, 
decomposing the frequency space into two pieces: the points on the circle which are close to rationals with 
small denominator, and those which are not. 

Definition 3. Given L e N and 77 > 0, we define, for each q E 'N and a e [1, 9], 

M,/, = M,/,(L,?7) = |aeT:|a-^|<-i^| and M, = |J M, 



/q- 

(a,q) = l 



We then define 9Jt, the major arcs, by 

q=l 

and m, the minor arcs, by 

m = T \ 97t. 

4.3. Concentration and Density Increment Lemmas. As usual, the philosophy behind the argu- 
ment is that a deficiency in the desired arithmetic structure from a set B represents nonrandom behavior, 
which should be detected in the Fourier analytic behavior of B. Specifically, we follow the approach of Lyall 
and Magyar [13] to locate one small denominator q such that /s has concentration around rationals with 
denominator q, then use that information to find a long arithmetic progression on which B has increased 
density. 

Lemma 5 (L^ Concentration). Suppose B C [1,L], \B\/L ^ cr > S, and L > \/N, and let rj — cia for a 
sufficiently small constant 02 > 0. Suppose further that 

qo I d, d/qo < max{log TV, g((5)}^ and 7^<i(S) < 1-^^/8. 

If \B n (i/9,8i/9)| > 30-L/4, then there exists q < r]-^ such that 



I 



|/B(a)pda»aT+iL. 



We now invoke a variation of the usual density increment. Specifically, we quote a result which follows 
from Proposition 7.2 of !l8l. 

Lemma 6 (Density Increment). Suppose B C [\,L\ with \B\ = crL and let rj — C2cr. // 

|/^(a)|2da > wa^i, 

then there exists an arithmetic progression 

P = {x + t\{q) :l<i<L'} 

with 

L/\{q)> L' mxa{rf ,uj(j}L/\{q) and \Ar\ P\/ L' > a + uja / 4,. 
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4.4. Proof of Lemma [3l Suppose B C [l,L] meets all the hypotheses of the lemma. 

If |Sn (L/9,8L/9)| < 30-L/4, then 

max{|Bn [l,L/9]|,|Bn [8L/9]|} > aL/8. 
In other words, B has density at least 9ct/8 on one of these intervals. 

Otherwise, Lemmas [5] and [6] apply, so in either case there exists q < rj~'^ and an arithmetic progression 
P = {x + i\{q) ■.1<1<L'} with 

i/A(g) > L' >(7''i/A(g)) > a^C^'+i^L and \B D P\/ L' > a + ca'' . 

This allows us to define a new set B' C [1, L'] by 

B' = {£e -.x + iXiq) e B}, 

which by Proposition 2] satisfies TZqd{B') < TZd{B), as required. □ 



4.5. Proof of Lemma [5j Suppose B C \B\/L = a > S, and L > \/N. Let rj = C2(t, and suppose 

further that qo \ d and d/qo < maxjlog A^, Q{S)}^. For the remainder of the proof, we keep this d fixed and 
omit it from the notations Hd, Md, Vd, Ti-d, and ^'d defined in Sections 2 and 3. 

Since hd{H) C [l,i/9), we see that 

X! .fB{x)fB{x + hd{y))v{y) = ^ Ib{x)Ib{x + hd(jj))v{y) - c ^ '^b{x)1[ix]{x + hd{y))vljj) 

xei x£Z kgz 

ySH yeH yGH 

~'^Y1 ~ hd{y))lB{x)v{y) + cr^ X! + hd{y))v{y) 

<7^(B) + (a2L-r7(|Bn [i,8i/9)| + |sn(L/9,L]|)) ^ Ky)- 



By (jlj we have that 

5^%) = * + 0(logi), 

so if |Bn (L/9,8L/9)| > 3ctL/4 and 7^(B) < ct2L^'/8, we have that 
(14) fB{x).fB{x + hd{y)Hy) < 

x£Z 

yen 

One can easily check using (U) and orthogonality of characters that 



(15) 



V fB{x)fB{x + hd{y)Hy) = / |i^(a)p5M(a)da + 0{L\ogL), 

..^n, Jo 



where 

X 

S-xCa) -^i^(a;)e2"''<'(^)". 

x=l 

Combining ([M)) and (fT5|) . we have 

(16) / |/^(a)|2|5A,(«)|da>a2L^'/16. 

Jo 

It follows from Lemma [2J an observation of Lucier on auxiliary polynomials, and Theorem 4.1 of [lOj that 

(17) \SM{a)\ < g-i/^* for ah aeMgCM, 



and 

(18) \SM{a)\ < < cr*/32 for all a € m, 

provided we choose C2 < 1/32C. We discuss these estimates in more detail in Section [SJ 

From ([TS]) and Plancherel's Identity, we have 

\h{a)\^\SM{a)\da<a^L'^/32, 



which together with (IT5|) yields 

(19) / \fB{a)\^\SM{a)\da>a^L^/32. 

By (H?]) and we have 

fj^i^ (y 5-1/7^ jnax / |/^(a)|2da<a-T+i max / |/^(a)|2da, 

and the lemma follows. □ 

5. Major and Minor Arc Estimates: Proof of (fT7j) and (fT5|) 

We remain in the setting of the proof of Lemma [SI recalling all hypotheses and notation defined there. 
We first state some required estimates, which we use to deduce pT|) and (|18p . then we include the necessary 
proofs in Appendix [K\ 

Lemma 7. If QiS) > log TV and a ^ a/q + ^ with q < {qoQ{6)^y^ , (a, (?) = I, and |^| < {qaQ{5f)'^/L, 
then 

SMia) = :^G(a,q) /'V-x(r.)(dx)''-i)e2-"''(-)^dx + 0(Me-^^'^^vW), 

Ji 

where 

9-1 
£=0 

(rd+<i£,g) = l 

Lemma 8. IfQ{6) < \ogN and a = a/q + P with q < {qo{\ogN)^y^ , (a, g) = 1, and |^| < (qoilogN^)^ /L, 
then 



SM{a) = ^^G(a, g) / e^^^'^'^^^f^dx + 0{Me-^^^). 
(j){qd) 



M 
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In Appendix 1X1 we exhibit how Lemma [7] follows from Lemma and Lemma [5] follows from Lemma [T] in an 
analogous, more standard way. 



Lemma 9. Suppose g(x) = ao + aix + • • • + akX G Z[a;]. // W, 6 G Z, q G N and (a, q) = 1, then 

9-1 j^/j, 
(20) I « (gcd(cont(g),gi)gcd(afc,g2)) g'-'/^ 

(Wl+b,q) = l 

where q — qiq2, 92 *s ^^e maximal divisor of q which is coprime to W, and 

cont(.g) := gcd(ai, . . . ,afe). 

The statement of Lemma [5] indicates that we could lose control of the sum G(a, q) if the coefhcients of the 
auxiliary polynomials hd share larger and larger common factors. The following observation of Lucier ensures 
that this does not occur. 



Lemma 10 (Lemma 28 in 'TT). For every d eN, 

contihd) < |A(/i)|('=-i)/2^ont(/i), 

where A(h) — a^*^^^ ni5^j('^i ~ Qfj)'^'^^ */ ^ factors over the complex numbers as a{x — ai^^ . . .{x — UrY'' 
with all the ai 's distinct. 

While the statement of Lemma [TU] is pleasingly precise, we only use that cont{hd) is uniformly bounded in 
terms of the original polynomial h. 

Corollary 11. If{a,q) — 1, then 

|G(a,q)|«gl-l/^ 
where the implied constant depends only on h. 

5.1. Proof of (|17|). We treat the case of Q{S) > \ogN using Lemma [71 and the other case follows in a 
similar, slightly simpler fashion from LemmaO Since r]~'^ < Q{S), the hypotheses of Lemma[7]are certainly 
satisfied whenever a € Mg with q < ri~'^ . Therefore, Lemma [7] and Corollary 111 [ combined with the bound 

(21) / (l-x(r<i)(da;)''-i)e2"''''(^)'5dx < M ~ x{rd){dMY / dp <i 



from Lemma [2] and the well known facts (j){qd) > (j){q)(f){d) and 

4>{q) > c^q^^^ for any /i > 0, 

yield 



\SM{a)\ < g-'/^* + 0{Me-^^ c^^T^y 

Finally, the lower bound 



(22) * > A//go > Me-^'^''^^^°^ 

given by ^ and (O ensures that the error term is negligible, and the estimate follows. □ 

For our minor arc estimate we need the following analog of Weyl's Inequality, due to Li and Pan, which 
generalizes work of Vinogradov. 

Lemma 12. Suppose g{x) — ao + aix + • • • + Okx'' e l\x\ with Ofc > 0, £), G N, and b E Z. If U > logD, 
Ok > |afc_i| H h |ao|, and W, \b\,ak < U'', then 

° D 

logiWx + 5)e2'^*f (^i;)" < — + U^D^^" 

Wx+bev 

for some constants C = C{k) and c = c{k) > 0, provided 

\a-a/q\<q^'^ for some U'^ < q < g{D)/U^ and {a,q)^l. 

Lemma 1121 is a rougher, only nominally generalized version of Theorem 4.1 of TO . That result restricts 
to the case where [/ is a power of log_D, and provides a more precise bound in place of K, but the main 
achievement of the theorem is that one can take U to be that small. Larger values of U, and hence stricter 
conditions on q, actually make the proof, which can be found in the appendix of that paper, slightly easier. 
Specifically, one can observe that the precise condition on q is not utilized until Lemmas 4.11 and 4.12, and 
adaptations of those lemmas are sufficient to adapt the proof of Theorem 4.1. 
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5.2. Proof of (|18p . Again, we only treat the case Q{5) > logiV. For a fixed a G m, we have by the 
pigeonhole principle that there exist 

l<q< L/iqoQ{5ff 

and (a, q) = I with 

\a-a/q\ < {q^Q{5f)^l{qL). 

If r]^^ < q< {qoQ{^)^)^ , then a meets the hypotheses of Lemma[71 and by reasoning identical to the proof 
of ini) we have 

If (goQ(<5)^)^ < g < L/{qoQ{Sf )'^ , then we can apply Lemma [T2l with U = goQ(<5)^, along with ^ and the 
fact that 7] > Q{5)^^ to conclude 

M ^ 
\SM{a)\ < ^^7^-2 < -—-2 < 77^-, 

as required. 

If 1 < 9 < '/ then, letting /? = a — a/g, it must be the case that 

(23) |/3| > rVL. 

as otherwise we would have a G 971. By Lemma [7] it suffices to show 



M 



From ([231) and Lemma 2.8 of [22], we have for any a; > 1 that 



hence by integration by parts. Lemma [21 and ([8]) we see 

I'M 

/ (1 -x(»'d)(rfa;)''-i)e2'^*'*'*(=")'3dx -qiM - x{rd){dMY /d) 

V dp 

and the estimate is complete. 



2(1 -p)Af < f]^, 



Appendix A. Exponential Sum Estimates: Proof of Lemma [3 Lemma [H and Corollary [TT] 

A.l. Proof of Lemma[7[ Fixing q < {qoQ{^)^)^ and (a, q) — 1, we first investigate the values of Sx{a/q) 
for X > A^i/iofc, We see that 



(24) 

and we note that 
(25) 



X q-l X 

Sx{a/q) = ^ i.(a;)e2^*''''(^)'^/« = ^ ^2n^h,{e)a/y ^ ^^^^^ 



2;=£ mod q 



v{x) ^ (j){d)i;{dX + rd, + dt, qd)/d. 



x=l 
x~i mod q 



Since {rd,d) — 1, we have that (r^ + d£, qd) = 1 if and only if (r^ + d£,q) = 1. Therefore, if (r^ + d£,q) > 1, 
we have V'(rf-'f + fd, rd + d£, qd) < log(dX + r^) <C logX, whereas if (r^ + d£, q) = 1, we have by ([25[) and 
Lemma [5] that 

^ 0(d) 



(26) 



a:=.^ mod q 



<t>{qd) 



X- 



x{rd)(dX)P /dp^ + 0{Xe~^"''^'^''/^) 
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Combining ([M]) and (P^ . we have 

for all X > N^/^°^ . In particular, since q < g^^'^'^iViogN ^^^^ jyj ^ N^^^'', we can apply trivial bounds for 
small values of X and conclude 

(27) Sx{a/q) = ^G(a,q)(x -x(rd)(rfX)7dp) +0(Afe-i"^'^^vW) 

for all X < M. 

Now suppose a — a/q + (3 with |/3| < {qaQ^S)"^)^ / L. By (P7|) and two applications of integration by parts, 
we see 

M 

SM{a) ^Y^„l^x)e^^il^^i^)'^/l^2^'^h,{x)l3 
x = l 



M 



SM{a/q)e^^'''-^^'^^ - J S^{a/ q)2ml3h'^{x)e^"'''^''^^ dx 



+ 0((1 + /3L)Me-i°^''=i^''^°s^) 



m 

4>{qd) 



G{a,q) J (l-x(?-d)(rfa;)''-ije2'^*''''(^)''da; + 0(Afe-5^f''=i^''^°8^), 



and the asymptotic is established. 

□ 



A. 2. Proof of Lemma [9j Fix g,W,b,a,q as in Lemma IHl We primarily make use of the well-known 
complete Gauss sum estimate 

9-1 

(28) I J2 e'"»(')"/« « gcd(cont(5), q)l/'=<zl-l/^ 

e=o 

which can be found for example in Lemma 6 of 12 . As is often the case with this type of sum, we can 
simplify our argument by taking advantage of multiplicativity. Specifically, it is not difficult to show that if 
q = qiq2 with (91,92) = l, then 

g-l iji-1 ga-l 

e=o ^1=0 ^2=0 

{We+b,q) = l {Wei+b,qi) = l {Wi2+b,q2) = l 

where a/q = ai/qi + 02/92, so we can assume without loss of generality that q — for some p (1 V, j G N, 
and separately consider the cases of p | and p\W. 

If p I and p I h, then Wt + 6 is never coprime to p^ , so the sum is clearly zero. If p | and p \ then 
Wi + 6 is always coprime to p' , so the sum is complete and the result follows from (|28|) . 
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If p -j" W, then p I W£ + 6 if and only if £ = ~bW ^ mod p. Therefore, 



(29) 



2Tzig{j)r+7n)a/ p-' 



1=0 
p\Wl+b 



1=0 



r=0 



where m = —bW ^ mod p, and by (j28p we need only obtain the estimate for the second sum. 
Setting 



g{pr + m) — .g(m) 



P 



we see that g is a polynomial with integer coefficients and leading coefficient akP^ ^ ■ In particular, 

gcd(cont(5),p^"^) < p^^'^ gcd(afe,p-'"^). 

Therefore, by we have 



E 



J2TTig{pr-\-m)a / p-' 



E 



o27ri(g(pr+m)-g(m))a/p^ 



^^■^ia{r)a/p' 

r=0 



as required. 



< gcd(afc,pJ)'/V 



□ 



A. 3. Proof of Corollary llll From its definition, we see that the leading coefficient of hd is d'^b/X{d), 
where b is the leading coefficient of h. Given g G N and {a,q) — 1, we write q — qiq2, where q2 is the 
maximal divisor of q which is coprime to d. In particular, 

(30) gcd{d''b/X{d),q2)<b. 

Therefore, by Lemma [9l Lemma [TOl and ((30|) we have 

9-1 



\G{a,q)\ 



E 



^2nih^(i)a/q 



e=o 

{ra+de,q) = l 



« (gcd(cont(/ld),gi)6)'^%l-l/'= « q^-'/'' 



and all the required estimates are established. 



□ 



Appendix B. Theorems [2] and [3j An Informal Discussion 

Using the result of Theorem [4l one can almost immediately conclude Theorem [3] by replicating the 
transference principle argument used in ^8^ to obtain Theorem F from a uniform version of Theorem C. 
Similarly, using weighted analogs of the major and minor arc estimates from this paper, one can almost 
immediately conclude Theorem [2] by reproducing the method of [6] used to prove Theorem D. In each 
instance, there are a few issues that arise and we address in this section, which is best read in conjunction 
with those two papers. First, we recall that for the arguments in 6 and [8 , it is convenient, if not necessary, 
to do analysis with a discrete frequency domain, that is to embed subsets of [l,iV] into the finite group 
Z/NZ as opposed to the integers. 
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B.l. Higher Moments of Weyl Sums. To adapt the methods of [B] and [5], we need analogous estimates 
on higher moments of weighted and unweighted exponential sums over polynomials in primes. Specifically, 
if we borrow some notation from Section [5] and define 

^(") = ^ E ^d(x)e2-''''(-)" and W{a) = ^ E ^d{xmx)e^-^'^^^'^^- , 

then it is straightforward to apply the major and minor arc estimates from this paper, weighted analogs 
thereof, and higher moment estimates on standard Weyl sums (see [24] for example) to conclude under 
appropriate conditions that 

E \T{t/N)\'^Nf |r(a)|Ma<l and ^ \W{t/N)\' ^ N f \W{a)\'da^l. 

t(£Z/NZ ° teZ/NZ " 

It is with these estimates in mind that we chose 5 = 2'^' + 6, although something much smaller would suffice, 
and the above equalities follow from the dependence on s in the definition of Hd, as the relevant mod N 
congruences imply equality. 



B.2. Applying Lemma [2] to Theorem [2l Because the method of 6 does not involve estimating the 
number of solutions to the desired equation, it suffices for the proof of Theorem [2] to use a simplified form of 
Lemma[2]in which Q{S) is replaced with e'^^^'"^^ throughout. In order to obtain a usable analog to Lemma 1 
of 0, we need to initially pass to a subprogression of step size X{qo) and replace the condition d < iV "^ with 
^ < gcviog N £qj, sufficiently small constant c. This requires us to replace the concentration upper bound 
cr^(log A'')"-'^"'"'^ with (T^(log -/V)~ 2+*^, which is the reason for the factor of 2 discrepancy between Theorem D 
and Theorem [2j 



B.3. "Square Root Cancellation" in Theorem [2l The proof of Theorem D intimately uses the fact 
that for a quadratic polynomial, the normalized, weighted Weyl sum has "square root cancellation" on the 
major arcs. In our setting, we can apply weighted analogs of Lemmas [7] and |8] to conclude under appropriate 
conditions that if t/N is close to a rational a/q with (a, q) = 1, then 

Wit/N) < ^ min{l, {N\t/N - a/q\)-^} < q-^/^ log log g min{l, {N\t/N - a/q\y^}, 

where W is as in Section IB. II and the last inequality is a standard estimate on (j). While this is not quite as 
good as the estimate used in the proof of Lemma 2 of , the error of log log q can easily be absorbed with 
other negligible terms as in the end of that proof (in fact log q would be fine as well) . For a more detailed 
proof of Theorem [21 see [16] . 

B.4. Rephrasing Theorem |4] to Deduce Theorem [3l Theorem |4] implies the following, less precise 
statement, which uses notation defined in Section [2] and is ready-made for applying a transference principle. 

Theorem 5. If h Cz 7L\t\ is a V -inter sective polynomial and F : Z/NZ — > [0, 1] with 

x&LjNTL 

then there exist constants c(h,S) > and NQ(h,S) such that 

^ E F{x}F{x + h4y))My)>c{h,S) 

provided d < logiV and N > No{h, S). 



Once armed with Theorem [5] and the unweighted higher moment estimate from Section IB. II Theorem |3] 
follows in the identical fashion that Theorem F follows from a uniform version of Theorem C, as in [8]. 
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